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Abstract. In this paper we study the role of cliquewidth in succinct 
representation of Boolean functions. Our main statement is the following: 
Let Z be a Boolean circuit having cliquewidth k. Then there is another 
circuit Z* computing the same function as Z having treewidth at most 
18fc + 2 and which has at most 4|Z| gates where \Z\ is the number of gates 
of Z. In this sense, cliquewidth is not more 'powerful' than treewidth for 
the purpose of representation of Boolean functions. We believe this is 
quite a surprising fact because it contrasts the situation with graphs 
where an upper bound on the treewidth implies an upper bound on the 
cUquewidth but not vice versa. 

We demonstrate the usefulness of the new theorem for knowledge com- 
pilation. In particular, we show that a circuit Z of cliquewidth k can 
be compiled into a Decomposable Negation Normal Form (dnnf) of size 
0(9^*''fc2|Z|) and the same runtime. To the best of our knowledge, this 
is the first result on efficient knowledge compilation parameterized by 
cliquewidth of a Boolean circuit. 



1 Introduction 

Cliquewidth is a graph parameter, probably best known for its role in the 
design of fixed-parameter algorithms for graph-theoretic problems In this 
context the most interesting property of cliquewidth is that it is 'stronger' 
than treewidth in the following sense: if all graphs in some (infinite) class have 
treewidth bounded by some constant c, then the cliquewidth of the graphs of 
this class is also bounded by a constant 0(2^). However, the opposite is not true. 
Consider, for example, the class of all complete graphs. The treewidth of this 
class is unbounded while the cliquewidth of any complete graph is 2. 

In this paper we essentially show that, roughly speaking, cliquewidth of a 
Boolean function is not a stronger parameter than its treewidth. In particular, 
given a Boolean circuit Z, we define its cliquewidth as the cliquewidth of the 
DAG of this circuit and the treewidth as the treewidth of the undirected graph 
underlying this DAG. The main theorem of this paper states that for any circuit 
Z of cliquewidth k there is another circuit Z* computing the same function 
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whose treewidth is at most 18fc + 2 and the number of gates is at most 4 times 
the number of gates of Z . Moreover, if Z is accompanied with the respective 
chque decomposition then such a circuit Z* (and the tree decomposition of width 
18fc + 2) can be obtained in time 0{k^n). The definition of circuit treewidth is 
taken from [14] and the definition of circuit chquewidth naturahy fohows from 
the treewidth definition. In fact, the relationship between circuit treewidth and 
chquewidth is put in [14 as an open question. 

We demonstrate that the main theorem is useful for knowledge compilation, 
that is, compact representation of Boolean functions that allows to efficiently 
answer certain queries regarding the considered function. In particular, we show 
that any circuit Z of chquewidth k can be compiled into decomposable nega- 
tion normal form (dnnf) [3] of size 0{9^''k'^\Z\) (where |Z| is the number of 
gates) by an algorithm taking the same runtime. To the best of our knowledge, 
this is the first result on space-efficient knowledge compilation parameterized by 
chquewidth. We believe this result is interesting because the parameterization 
by cliqewidth, compared to treewidth, allows to capture a wider class of inputs 
including those circuits whose underlying graphs are dense. 

This bound is obtained as an immediate corollary of the main theorem and 
the 0{9'^t'^\Z\) bound on the dnnf size for the given circuit Z, where t is the 
treewidth of Z. The intermediate step for the latter result is an 0(3p(|C| -|- 
n)) bound of the dnnf size of the given CNF where C and n are, respectively 
the number of clauses and variables of this CNF and p is the treewidth of its 
incidence graph. All these 3 bounds significantly extend the currently existing 
bound 0{2^n) of [3] where r is the treewidth of the primal graph of the given 
CNF. For example, if the given CNF has large clauses (and hence a large treewidth 
of the primal graph) then the 0(2''7i) bound becomes practically infeasible while 
the 0(3P(C + n)) bound may be still feasible provided a small treewidth of the 
incidence graph and a number of clauses polynomially dependent on n. 

2 Related Work 

The algorithmic power of chquewidth stems from the meta-theorem of ^ stat- 
ing that any problem definable in Monadic Second Order Logic (MSOi) can be 
solved in linear time for a class of graphs of fixed chquewidth k. The chquewidth 
of the given graph is NP-hard to compute [S] and it is not known to be FPT. On 
the other hand, chquewidth is FPT approximable by an FPT computable pa- 
rameter called rankwidth |13lll| . As said above, there are classes of graphs with 
unrestricted treewidth and bounded chquewidth. However, it has been shown in 
|10j that the only reason for treewidth to be much larger than chquewidth is the 
presence of a large complete bipartite graph (biclique) in the considered graph. 
In fact, we prove the main theorem of this paper by applying a transformation 
that eliminates all bicliques from the DAG of the given circuit. 

DNNFs have been introduced as a knowledge compilation formalism in j^, 
where it has been shown that any CNF on n variables of treewidth t of the primary 
graph can be compiled into a dnnf of size 0{2*tn) with the same runtime. A 



detailed analysis of special cases of dnnf has been provided in [B]. In particular, it 
has been shown that Free Binary Decision Diagrams (fbdd) and hence Ordered 
Binary Decision Diagrams (obdd) can be seen as special cases of dnnf. In fact, 
there is a separation between dnnf and fbdd This additional expression 
power of DNNF has its disadvantages: a number of queries that can be answered 
in polynomial time (polytime) for fbdd and obdd are NP-complete for dnnf [B]. 
This trade-off led to investigation of subclasses of dnnf that, on one hand, retain 
the succinctness of dnnf for CNFs of small treewidth and, on the other hand, have 
an increased set of queries that can be answered in polytime. Probably the most 
notable result obtained in this direction are Sentential Decision Diagrams (sdd) 
[5] that, on one hand, can answer in polytime the equivalence query (possibility 
to answer this query in polytime for OBDDs is probably the main reason why this 
formalism is very popular in the area of verification) and, on the other hand, 
retains the same upper bound dependence on treewidth as dnnf. 

In fact the size of obdd can also be efficiently parameterized by the treewidth 
of the initial representation of the considered function. Indeed, there is an obdd 
of size 0{n2P) where p is the pathwidth of the primal graph of the given CNF 
and of size (n*^*^*') where t is the treewidth of the graph, see e.g. [9 . It is shown 
in jl4] that similar pattern retains if we consider the pathwidth and treewidth 
of a circuit but in the former case p is replaced by an exponential function of p 
and in the latter case, t is replaced by a double exponential function of t. 

3 Preliminaries 

A labeled graph G = {V,E,S), in addition to the usual set V{G) of vertices 
and a set E{G) of edges, contains a component S(G), which is a partition of 
V{G). Each element of the partition class is called a label. A simplified clique 
decomposition (scd) is a pair (T, G) where T is a rooted tree and G is a family of 
labeled graphs. Each node t of T is associated with a graph G{t), which is defined 
as follows. If t is a leaf node, then G{t) = {{v}, {{«}}). Assume that t has two 
children h and and let Gi = G(ti) and G2 = G{t2). Then V{Gl)C^V{G2) = 
and G(t) = {V{Gi) U V[G2),E{Gi) U £;(G2),S(Gi) U S(G2)). Finally, assume 
that t has only one child ti and let Gi = G(ti). Graph G{t) can be obtained 
from Gi by one of the following three operations: 

- Adding a new vertex. There is v ^ y{Gi) such that G{t) = (V(Gi) U 
W,i?(Gi),S(Gi)U{M}). 

- Union oflabels. There are 5i, 5*2 £ S(Gi) such that G(t) = (V{Gi), E{Gi), 
(S(Gi) \ {S*!, 52}) U {S}). We say that Si and 5*2 are children of S. 

- New adjacency. There are 5i, 5*2 G S(Gi) such that G{t) = {V{Gi), E{Gi)U 
{{u,v}\u G S'i,w G 52},S(Gi)). We say that Si and S2 are adjacent. 

The width of a node t of T is |S(Gt)|. The width of (T, G) is the largest 
width of a node t of T. Let r be the root of T. Then we say that (T, G) is an SCD 
of G(r) and of {V{G{r)), E{G{r))) (the unlabeled version of G(r). The simplified 
cliquewidth (scw) of a graph G is the smallest width among all SCDs of G. The 



definition of SCD is closely related to the standard notion of clique decomposition. 
In fact sew of a graph G is at most twice larger than the cliquewidth of G. The 
details of comparison are postponed to the appendix. 

Clique decomposition and SCD are easily extended to the directed case. In 
fact the notion of cliquewidth has been initially proposed for the directed case, 
as noted in f7^. The only change is that the new adjacency operation adds to 
G{t) all possible directed arcs from label 5*1 to label 52 instead of undirected 
edges. In this case we say that there is an arc from 5*1 to 52- 

We denote Utev(T) S(G'(t)) by S = S(T, G) and call it the set of labels of 
(T,G). 

A tree decomposition of a graph G is a pair (T, B) where T is a tree and the 
elements of B are subsets of vertices called hags. There is a mapping between the 
nodes of T and elements of B. Let us say a vertex w of G is contained in a node 
t of T if w belongs to the bag B{t) of t. Two properties of a tree decomposition 
are connectedness (all the nodes containing the given vertex v form a subtree of 
T), adjacency (each edge {it,w} is a subset of some bag), and union (the union 
of all bags is V{G). In this paper we consider the treewidth of a directed graph 
as the treewidth of the underlying undirected graph. 

Boolean circuits considered in this paper are over the basis {V,A, ->}. In 
such a circuit there are input gates (having only output wires) corresponding to 
variables and constants true and false. The output of each gate of a circuit Z 
computes a function on the set of input variables. We denote by functions{Z) 
the set of all functions computed by the gates of Z. The number of gates of Z 
is denoted by \Z\. 

A clique or tree decomposition of a circuit Z is the respective decomposition 
of the DAG of Z. In our discussion, we often associate the vertices of the DAG 
with the respective gates. De Morgan circuits are a subclass of circuits where the 
inputs of all the not gates are variables (i.e. the outputs of not gates serve as 
negative literals). For a gate g of Z, denote by Var{g) the set of variables having 
a path to g in the DAG of Z. A circuit Z has the decompos ability property if 
for any two in-neighbors gi and g2 of an and gate g, Var{gi) n Var{g2) = 0. 
DNNF is a decomposable De Morgan circuit. When we consider a general circuit 
Z , we assume that it does not have constant input gates, since these gates can 
be propagated by removal of some gates of Z , which in turn does not increase 
the cliquewidth nor the treewidth of the circuit. However, for convenience of 
reasoning, we may use constant input gates when we describe construction of a 
DNNF. If the given circuit Z is a CNF then its variables-clauses relation can be 
represented by the incidence graphs a bipartite graph with parts corresponding 
to variables and clauses and a variable-clause edge representing occurrence of a 
variable in a clause. 

4 Prom small cliquewidth to small treewidth 

The central result of this section is the following theorem: 



Theorem 1. Let F be a circuit of cliquewidth k over n variables Then there is a 
circuit F* oftreewidth at most 18fc+2 and \F*\ < 4|_F| such that functions{F) C 
functions{F*). Moreover, given F and a clique decomposition of F of width k 
there is an 0{k^n) algorithm constructing F* and a tree decomposition of F* of 
width at most 18k + 2 having at most 2\F\ bags. 

The rest of this seetion is the proof of Theorem [TJ The main idea of the 
proof is to replace 'parts' of the given circuit forming large bicliques by circuits 
computing equivalent functions where such bicliques do not occur. As an example 
consider a CNF of 3 clauses Ci = (ai V 02 V 03 V bi), C2 = {a2 V 02 V 03 V 62) 
and C3 = (ai V 02 V 03 V &3). The circuit of this graph contains a biclique of 
order 3 created by Ci,C2,C3 on one side and 01,02,03 on the other one. This 
biclique can be eliminated by the introduction of additional OR gate C4 having 
input 01,02,03 and output C4 so that the clauses Ci,C2,C3 are transformed 
into (61 V C4), (&2 V C4), (63 V C4), respectively. It is not hard to see that the new 
circuit computes the same function as the original one. This is the main idea 
behind the construction of circuit F* . The formal description of the construction 
is given below. 

For the purpose of construction of F* we consider a type respecting SCD 
(T, G) of F where each non-singleton label is one of the following: 

— A unary label containing input gates and negation gates. 

— An AND label containing and gates. 

— An OR label containing OR gates. 

The following lemma essentially follows from splitting each label of the given 
clique decomposition into three type respecting labels. 

Lemma 1. Let k be the cliquewidth of F and let k* be the smallest width of an 
SCD of F that respects types. Then k* < 6fc. 

Proof. Let (T*, G*) be an SCD of Z having width at most 2k (existing since 
the cliquewidth is k). In each graph G' G G* split each label into at most 3 
subsets so that each subset contains one type of the gates as specified above. 
Clearly, the resulting number of labels is at most 3 times larger than the original 
one. The resulting structure is not necessarily an SCD. In particular, there may 
be situation when the graph associated with a node is the same as the graph 
associated with the parent node because the union operation in the parent has 
been reversed by the splitting. Also, the new adjacency operation may become 
applied between more than one pair of labels. However, a legal SCD is easy to 
recover: the 'redundant' parent nodes can be removed (since they are unary this 
will no cause problems with the structure of the binary tree) and each node with 
a multiple adjacency operation can be replaced by a sequence of nodes applying 
these operations one by one. I 

Given a type respecting SCD (T, G), let us construct the circuit F* . In the 
first stage, we associate each label S £S with a set of gates as follows: 

— If 5 is non-singleton then it is associated with an and gate denoted by 
oand{S) and an OR gate denoted by oor{S). 



— If S" is non-singleton and does not contain input gates then it is associated 
with an additional gate called in[S) whose type is determined as follows: 
If S is an and or OR label then in{S) is an and or OR gate, respectively. 
If S* is a unary label then in(S) is a circuit (perceived as a single atomic 
gate) consisting of two not gates, the output of one of them is the input 
of the other. So, the input of the former and the output of the latter are, 
respectively, the input and output of in{S). 

— Each singleton label {g} is associated with the gate g of F. We call the gates 
associated with singleton labels original gates because they are the gates 
of F* appearing in F. For the sake of uniformity, for each original gate g 
associated with label S, we put g — oand{S) — oor{S) ~ in{S). 

The wires of F* are described below. When we say that there is a wire from 
gate gi to gate g2, we mean that the wire is from the output of gi to the input 

of 52- 

— Child-parent wires. Let Si and S2 be labels of (T, G) such that Si is a 
child of 5*2. Then there is a wire from oand{Si) to oand{S2) and a wire from 
oar (Si) to oar (32)- 

— Parent-child wires. Let 5*1 and S2 be as above and assume that 5*2 does 
not contain input gates. Then there is a wire from in{S2) to in{Si). That is, 
the direction of child-parent wires is opposite to the direction of parent-child 
wires. H 

— Adjacency wires. Assume that in (T, G) there is an arc from Si to 5*2 
(established by the new adjacency node). Then the following cases apply: 

• If 52 is an and label then put a wire from oand{Si) to in{S2)- 

• If ^2 is an OR label then put a wire from oor{Si) to in{S2)- 

• If 5*2 is a unary label consisting of negation gates only then put a wire 
from an arbitrary one of oand{Si) or oor{Si) to in{S2)- 

Finally, we remove in(S) gates that have no inputs. This removal may be 
iterative as removal of one gate may leave without input another one. 

It is not hard to see by construction that F and F* have the same input gates. 
This gives us possibility to state the following theorem with proof in Section HTTl 

Theorem 2. F* is a well formed circuit. The output of each original gate g of 
F* computes exactly the same function (in terms of input gates) as in F. 

In Section 1321 we prove that the treewidth of F* is not much larger than the 
width of (T,G). 

Theorem 3. There is a tree decomposition of F* with at most 2\F\ bags having 
width at most 3k + 2, where k is the width of (T, G). 
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Now we are ready to prove Theorem [TJ 

Proof of Theorem [1] Due to Theorem [5J functions{F) C functions{F*). 
If we take (T, G) to be of the smallest possible type respectmg width then the 
treewidth of F* is at most 18fc + 2 by combination of Theorem [3] and Lemma [TJ 

To compute the number of gates of F* , let n be the number of gates of F, 
which is also the number of singleton labels of (T, G). Since each non-singleton 
label has two children (i.e. in the respective tree of labels each non-leaf node is 
binary), the number of non-singleton labels is at most n — I. By construction, 
F* has one gate per singleton label plus at most 3 gates per non-singleton label, 
which adds up to at most An. 

The technical details of the runtime derivation are postponed to the ap- 
pendix. ■ 

4.1 Proof of Theorem [2] 

We start with establishing simple combinatorial properties of F* (Lemmas I2l3i 
I4I5I) . A path in a circuit is a sequence of gates so that the output of every gate 
(except the last one) is connected by a wire to the input of its successor. Let us 
call a path a connecting path if it contains exactly one adjacency circuit. 

Lemma 2. — Any path P of F* starting at an original gate and not containing 
adjacency wires contains child-parent wires only. 
— Any path P of F* ending at an original gate and not containing adjacency 
wires contains parent-child wires only. 

Proof. The only possible wire to leave the original gate is a child-parent 
wire. Any path starting from an original gate and containing child-parent wires 
only ends up in an oand or oar gate. This means that the next wire (if not an 
adjacency one) can be only another child-parent wire. Thus the correctness of 
the lemma for all the paths of length i implies its correctness for all such paths 
of length i 1, confirming the first statement. 

For the second statement, we start from an original gate and go back against 
the direction of wires. The reasoning similar to the previous paragraph applies 
with the in gates of non-singleton labels replacing the oar and oand ones. H 

Lemma 3. Let gi and 52 be gates of F such that g2 is an AND or an OR gate. 
Then there is a wire from gi to 52 in F if and only if F* has a connecting path 
from gi to 52 such that all the gates of this path except possibly gi are of the 
same type as (72- 

Proof. We prove only the case where 32 is an and gate, the other case is 
symmetric. Let P be a connecting path of F* from gi to 52 of the specified kind. 
Let g[ and ^2 be, respectively, the tail and the head gates of the adjacency wire. 
Then either gi = g[ or the suffix of P ending at g'l consists of child-parent wires 
only according to Lemma [H It follows that g[ corresponds to a label containing 
gi. Analogously, we conclude that either 52 = 92 or the suffix of P starting 
at 92 contains only parent-child labels and hence the label corresponding to 52 



contains 32- Existence of the adjacency wire from the label of to the label of 
(^2 means that the SCD introduces all wires from the gates in the label of g[ to 
the gates in the label of 52- In particular, there is a wire from gi to 32 in i^. 

Conversely, assume that there is a wire from gi to 32 in F. Then there are 
labels 5*1 and 6*2 containing 171 and 52, respectively, such that (T, G) introduces 
an adjacency arc from 5*1 to 82- By construction of F* there is a gate g'l corre- 
sponding to Si and a gate S'2 corresponding to 52 such that F* has an adjacency 
wire from g'l to (72- Moreover, by the definition of a type respecting SCD, ^2 is 
an AND label, hence g'2 = in{S2) is an and gate. Furthermore, by construction 
of F* either 52 = 92 or there is a path from 32 to 52 consisting of parent-child 
arcs only and and gates only. Indeed, if S'2 is not a singleton then there is a 
wire from in{S2) to in{S^) containing 52 since ^3 is the parent of S'2. Iterative 
application of this argument produces a path from ^2 to 52 ■ Since 52 is an AND 
gate, all gates in this path are AND gates by construction. Thus the suffix exists. 
What about the prefix? By construction, g'^ — oand{Si). Since Si contains gi, 
either g'^ — gi or there is a path from gi to g'l involving child-parent wires and 
AND gates only: just start at gi and go every time to the oand-gate of the parent 
until Si has been reached. Thus we have established existence of the desired 
prefix. 

It remains to be shown that the prefix and suffix do not intersect. However, 
this is impossible due to the disjointness of Si and S'2. B 

Lemma 4. Let gi and 52 be the gates of F such that 52 *s « NOT gate. Then F 
has a wire from gi to 52 if md only if there is a connecting path P in F* from gi 
to 52 with the adjacency wire {g'i,g2) such that gi = g'l and all the intermediate 
vertices in the suffix of P starting from g[ are in-gates of unary labels containing 
negation gates only. 

Proof. Let P be a connecting path of F* of the specified form. Then either 
g'2 — g2 or g'2 corresponds to a label containing g2. In both cases this means that 
F has a wire from gi to (72- 

Conversely, assume that F has a wire from gi to 52 • Then there are labels Si 
and S2 containing gi and 52 such that (T, G) sets an adjacency wire from Si to 
S'2. Observe that S'l cannot contain more than one element because in this case 
(72, a NOT gate, will have two inputs. Furthermore, either S'2 contains g2 only or 
S'2 is a unary label containing negation gates only (because the input gates do 
not have input wires). In the latter case, the desired suffix from the head of the 
adjacency arc to .92 follows by construction. ■ 

Lemma 5. Any path of F* between two original gates that does not involve 
other original gates is a connecting path. 

Proof. First of all, let us show that any path of F* between original gates 
involves at least one adjacency wire. Indeed, by Lemma O any path leaving an 
original gate and not having adjacency wires has only child-parent wires. Such 
wires lead only to bigger and bigger labels and cannot end up with a singleton 
gate. It follows that at least one adjacency wire is needed. 



Let us show that additional adjacency wires cannot occur without original 
gates as intermediate vertices. Indeed, the head of the first adjacency wire is 
an in gate of some label S. Unless S" is a singleton, the only wires leaving 
in{S) are parent-child wires to the in gates of the children of S. Applying this 
argumentation iteratively, we observe that no other wires except parent-child 
wires are possible until the path meets the in gate of a singleton label. However, 
this is an original gate that cannot be an intermediate node in our path. It 
follows that any path between two original gates without other original cannot 
involve 2 adjacency wires. Combining with the previous paragraph, it follows 
that any such path involves exactly one adjacency wire, i.e. it is a connecting 
path. ■ 

Using the lemmas above, it can be shown that any cycle in F* involves at 
least one original gate and that this implies that F contains a cycle as well, a 
contradiction showing that F* is acyclic. The technical details of this derivation 
are provided in the lemma below. By construction, each wire connects output to 
input and there are no gates (except the input gates of course) having no input. 
It follows that F* is a well formed circuit. 

Lemma 6. F* has no cycles. 

Proof. Observe first that if F* has a cycle involving at least two original 
gates gi and 172 then we can conclude existence of such cycle in F, which will 
supply us a desired contradiction. Indeed, let gi, . . . ,gr be all the original gates 
of the cycle. Then, according to Lemma[5]there is a connecting path between any 
two consecutive singleton gates and also between gr and gi. Applying Lemmas 
[31 and m depending on the nature of the specific gates, we observe that in F 
there are wires from each to gi-)_i (treating r -t- 1 = 1) that is, F has a cycle, 
a contradiction. 

Furthermore, let us observe that existence of one original gate in a cycle 
implies existence of another one. Indeed, following the argumentation in the 
proof of Lemma [5l we observe that to arrive from a singleton gate to a singleton 
gate (even to itself) one has to go through an adjacency wire. However, the 
label on the other side of the adjacency wire is disjoint with the label of the 
tail side and thus when we start to descend through in{S) gates we eventually 
(without closing the cycle before that since we have not arrived yet at the initial 
original gate!) will encounter another original gate, different from the starting 
one. Similar argumentation means that any in-gate in a cycle imply the presence 
of a singleton gate eventually. This rules out adjacency and parent-child arc from 
a potential cycle and leaves us only with child-parent arc but they are acyclic 
by construction since they go from a smaller label to a larger one. ■ 

In the rest of the discussion we implicitly assume that F* is well formed 
without explicit reference to Lemma [6] 

For each gate g of F* denote by f{g, F*) the function computed by a subcir- 
cuit of F* rooted by g. We establish properties of these functions from which The- 
orem[2]will follow by induction. In the following we sometimes refer to f{g, F*) 
as the function of g. 



Lemma 7. For each not gate g of F* , f{g,F*) is the negation of f{g' ,F*), 
where g' is the input of g in F. 

Proof. According to Lemma 21 F* has a path from g' to g where all vertices 
except the first one are not gates. Since all of them but the last one are doubled, 
there is an odd number of such not gates. Each not gate has a single input, 
hence the function of each gate of the path (except the first one) is the negation 
of the function of its predecessor. Hence these functions are, alternatively, the 
negation of the function of g' and the function of g' . Since the number of NOT 
gates in the path is odd, the function of g is the negation of the function of g', 
as required. ■ 

In order to establish a similar statement regarding and and OR gates we need 
two auxiliary lemmas. 

Lemma 8. For each label S, f{oand{S), F*) is the conjunction of f{g,F*) of 
all original gates g contained in S. Similarly, f{oor{S),F*) is the disjunction of 
the functions of such gates. 

Proof. We prove the lemma only for the oand gates as for the oor gates 
the proof is symmetric. The proof easily goes by induction. For an original 
gate this is just a conjunction of a single element, namely itself, and this is 
clear by construction. For a larger label S, it follows by construction that 
f{oand{S),F*) = f{oand{Si),F*)Af{oandiS2,F*)), where Si and 5*2 are the 
children of S. For 5*1 and 5*2 the rule holds by the induction assumption. Hence, 
f{oand{S), F*) is the conjunction of all the functions of all the original gates in 
the union of Si and 5*2, the same as f{oand{S), F*) is the conjunction of the 
functions of all the original gates contained in S, as required. I 

Let us call a path of F* semi- connecting if it starts with an adjacency wire 
and the rest of the wires are parent-child ones. 

Lemma 9. Let S be an and label. Then f(in{S),F*) is the conjunction of the 
functions of all gates from which there is a semi- connecting path to in{S). For the 
OR label the statement is analogous with the conjunction replaced by disjunction. 

Proof. We provide the proof only for the and label, for the OR label the proof 
is analogous with the corresponding replacements of and by OR and conjunctions 
by disjunctions. 

The proof is by induction on the decreasing size of labels. For the largest 
AND label S, all the input wires are the adjacency wires. Clearly the considered 
function is the conjunction of the functions of the gates at the tails of these 
adjacency wires. It remains to sec if there are no more gates to arrive at in(S) 
by semi-connected paths. But any such gate, after passing through the adjacency 
wire must meet an ancestor of S and, by the maximality assumption, S has no 
ancestors. 

The same reasoning as above is valid for any label S without ancestors. If S 
has ancestors, then f{in{S), F*) is the conjunction of the functions of the gates 
at the tails of the adjacency wires incident to in{S) and the function of the in 



gate of the parent of S . By the induction assumption, this function is in fact a 
conjunction of the gates at the tails of the adjacency wires incident to in(S) plus 
those connected to in{S) by semi-connected paths through the parent. Since any 
semi-connected path either directly hits in{S) at the head of an adjacency wire 
or approaches it through the parent, the statement is proven. ■ 

Lemma 10. The function of any original AND gate g of F* is the conjunction 
of the functions of the singleton gates whose outputs are the inputs of g in F. 
The same happens for the OR gate and the disjunction. 

Proof. As before, we prove the statement for the and gate, for the OR gate 
it is analogous with the respective substitutions. By construction and Lemma [9l 
f{g, F*) is the conjunction of functions of all oand gates (since there are no other 
ones) connected to g by semi-connected paths. Let us call the labels of these oand 
gates the critical labels. Combining this with Lemma[51 we see that f{g, F*) is in 
fact a conjunction of the functions of all original gates contained in the critical 
labels. It remains to show that these gates are exactly the in-neighbors of g in 
F. Let us take a particular in-neighbor g'. By Lemma [3l there is a connecting 
path from g' to g and by LemmajSJ the tail of the adjacency wire of this path is 
the oand gate of a critical label, so g' is in the required set. Conversely, assume 
that g' is a gate in the required set. Specify a critical label S g' belongs to. 
Clearly, there is a child-parent path from g' to oand{S) which, together with a 
semi-connected path from oand{S) to g, makes a connecting path. The latter 
means that in F there is a wire from g' to g according to Lemma |3l as required. 
■ 

Proof of Theorem [2j Let us order the gates topologically and do induction 
on the topological order. The first gate is an input gate and the function of the 
input is just the corresponding variable both in F and in F* . Otherwise, the 
gate is AND or OR or not gate. In the former two cases, according to Lemma 
[TOl the function of g in F* is the conjunction (or disjunction, in case of or) of 
the functions of its inputs in F, the same relation as in F. The theorem holds 
regarding the inputs by the induction assumption, hence the function of g in F* 
is the same as in F. Regarding the not gate, the argumentation is analogous, 
employing Lemma [71 ■ 

4.2 Proof of Theorem [3] 

Let us define the undirected graph H = H{T, G) called the representation graph 
of (T, G) as follows. The vertices of this graph are the labels of (T, G) and two 
vertices 5*1 and S2 are adjacent if and only if either is a child of S2 (or vice 
versa of course) or Si and ^2 are adjacent in (T, G) (meaning that the new 
adjacency operation is applied on Si and £'2). We call the first type of edges 
child-parent edges and the second type adjacency edges. 

Lemma 11. Let t be the treewidth of H. Then the treewidth of F* is at most 
3t + 2. 



Proof (Sketch). Observe that if we contract the gates in F* of each label 
into a single vertex, eliminate directions and remove multiple occurrences of 
edges, we obtain a graph isomorphic to H. The desired tree decompositom is 
obtained from the tree decomposition of H by replacing the occurrence of each 
vertex of in a bag by the gates corresponding to this vertex. Thus, there is a 
tree decomposition of F* with at most 3{t + 1) elements in each bag, that is the 
treewidth of F* is at most 3t + 2. ■ 

Lemma 12. The treewidth of H is at most k, where k is the width of (T, G). 

Proof. For each node t of T, let S{t) be the set of labels of the graph asso- 
ciated with t. Consider the structure (T, B) where B is a family of subsets of H 
associating with each node t a set B{t) consisting of vertices of H corresponding 
to the elements of S(t). Wc arc going to show that (T, B) is a tree decomposition 
of graph H' obtained from H by removal of all child-parent edges. 

First of all, observe that for each v G V{H), the subgraph T„ of T consisting 
of all nodes containing w is a subtree of T. Let us consider T as a rooted tree with 
the root t being the same as in (T, G) . Let ti and t2 be two nodes containing 
V. Then one of them is an ancestor of the other. Indeed, otherwise ti and t2 
arc nodes of two disjoint subtrees Ti and T2 whose roots t'l and t'2 are children 
of some node t* . By the definition of SCD, G{t[) is disjoint with 6(^2) and 
it is not hard to conclude from the definition that V{G{ti)) C V{G{t[)) and 
V{G{t2)) C V{G{t2)) arc disjoint. Since any label is a subset of the set of vertices 
of the graph it belongs to, S{ti) and S{t2) cannot have a common label and 
hence B{ti) and B{t2) cannot have a joint node. Furthermore, it is not hard to 
observe, if ti is ancestor of t2 and S G S{ti) D S{t2) then S belongs to S{t') of 
all nodes t' in the path between ti and t2- Of course, the same is true regarding 
the node of H corresponding to S. Thus we have shown that if ti and t2 contain 
V they cannot belong to different connected components of T„, confirming the 
connectedness of T„. 

Next, we observe that if vi and V2 are incident to an adjacency edge then 
there is a node t containing both vi and V2. Indeed, let Si and ^2 be the labels 
corresponding to vi and 112, respectively. Let t be the node where the adjacency 
operation regarding and ^2 is applied. Then both Si and ^2 belong to S{t) 
and, consequently, t contains both vi and V2- Finally, by construction, each 
vertex of H is contained in some node. 

To obtain the desired tree decomposition of H, we are going to modify (T, B) 
to acqiiirc two properties: that the number of nodes of the resulting tree is at 
most 2\F\ and that each parent-child pair u,v is contained in some node t. For 
the former just iteratively remove all nodes whose operations are new adjacency. 
If the node t being removed is not the root then make the parent of t to be the 
parent of the only child of t (since t has only one child the tree remains binary). 
The latter property can be established by adding at most one vertex to each 
bag of the resulting structure (T',B'). Indeed, for each non-singleton label S, 
let t{S) be the node where this label is created by the union operation. Then 
both children of S belong to the only child of t{S). Let (T', B*) be obtained from 



(T'j B') as follows. For each non-singleton label S, add the vertex corresponding 
to S to the bag of the child of t{S). Since at most one new label is created per 
node of T' , at most one vertex is added to each bag. It is not hard to see both 
of the modifications preserve properties stated in the previous paragraphs and 
achieve the desired properties regarding the child-parent edges. Since each bag 
of (T, B') contains at most fc -t- 1 elements, we conclude that the treewidth of H 
is at most k. Since the number of bags is at most as the number of labels, we 
conclude that the number of bags is at most 2|F| ■ 

Proof of Theorem [3l Immediately follows from the combination of Lemmas 
[TT] and Lemma [121 ^ 

5 Application to knowledge compilation 

In this section we demonstrate an application of Theorem [T] to knowledge com- 
pilation by showing existence of an algorithm compiling the given circuit Z into 
DNNF. Both the time complexity of the algorithm and the space complexity of 
the resulting dnnf are fixed-parameter linear parameterized by the cliquewidth 
of Z. More precisely, the statement is the following: 

Theorem 4. Given a single- output circuit Z of cliquewidth k, there is a dnnf 
of Z having size 0{9^^k'^\Z\). Moreover, given a clique decomposition of Z of 
width k, there is a 0(9^^'^fc^|Z|) algorithm constructing such a DNNF. 

Theorem 2] is an immediate corollary of Theorem [T] and the following one: 

Theorem 5. Given a single-output circuit Z of treewidth p, there is a dnnf 
of Z having size 0{9Pp'^\Z\). Moreover, such a DNNF can be constructed by an 
algorithm of the same runtime that gets as input the circuit Z and a tree decom- 
position of Z of width p having 0{Z) bags. 

The rest of this section is a proof of Theorem [5j Our first step is Tseitin 
transformation from circuit Z into a CNF F' . For this purpose we assume that 
Z does not have paths of 2 or more not gates. Depending on whether this path 
is of odd or even length, it can be replaced by a single not gate or by a wire, 
without treewidth increase. In this case the variables yi, ■ ■ ■ ,ym of F' are the 
variables of Z and the outputs of AND and OR gates of Z. Under this assumption, 
it is not hard to see that the inputs of each gate are literals of yi, . . . , Then 
the output x of Z is either yi or -ly^ for some i. Let us call x the output literal. 

The CNF F' is a conjunction of the singleton clause containing the output 
literal and the CNFs associated with each AND and OR gate. Let C be an AND 
gate with inputs ti, . . . ,tr and output z. Then the resulting CNF is {ti V -iz) A 
. . . A (tr V -iz) A (-iti V ... V -i^r V 2;). If C is an OR gate then the resulting CNF 
is (-1^1 V z) A ... A {^tr V z) A (ii V . . . V V -iz). We call the last clause of the 
CNF of C the carrying clause w.r.t. C and the rest are auxiliary ones w.r.t. C 
and the corresponding input. 

To formulate the property of Tseitin transformation that we need for our 
transformation, let us extend the notation. We consider sets of literals that do 



not contain a variable and its negation. For a set S of literals, Var{S) is the 
set of variables of S. The projection Pr{S, V') of to a set V of variables is 
the subset S" of S obtained by the removal of variables that are not in V'. Let 
S be a family of sets of literals over a set V of variables. Then the projection 
Pr(S, V) of S to V"' C y is {Pr{S, V')\S e S}. Denote by Var{Z) and Var{F') 
the sets of variables of Z and F', respectively. Let us say that a set S of literals 
with Var{S) — Var{Z) is a satisfying assignment of Z if Z is true on the truth 
assignment on Var(Z) that assigns all the literals of S to true. For a CNF, the 
definition is analogous. The well known property of Tseitin transformation is the 
following: 

Lemma 13. Let Si and S2 be the sets of satisfying assignments of F' and Z , 
respectively. Then Pr{Si,Var{Z)) = S2. 

Lemma [T3l is useful because of the following nice property of dnnf. 

Lemma 14. (Theorem 9 of fS^). Let Z be a DNNF let V C Var{Z) and let 
Z' he the dnnf obtained from Z by replacing the variables ofVar(Z) \ V' with 
the true constant. Let S and S' be sets of satisfying assignments of Z and Z' , 
respectively. Then S' = Pr{S,V'). 

Thus it follows from Lemmas [T51 and [Til that having compiled F' into a dnnf 
D', a DNNF D oi Z can be obtained by replacing the variables of Var{F')\Var{Z) 
with the true constant. Clearly, this does not incur any additional gates. In order 
to obtain a DNNF of F' , we observe that the treewidth of the incidence graph of 
F' is not much larger than the treewidth of Z. 

Lemma 15. Let (T, B) be a tree decompositoion of Z of width p. There is a 
0(p^|T|) time algorithm (\T\ is the number of nodes ofT) transforming (T, B) 
into a tree decomposition (T*,B*) of the incidence graph G' of F' having width 
at most 2p + 1 and with \T*\ = 0(p'^|r|). 

Proof. Let F" be the CNF obtained from F' by removal of all the clauses 
but the carrying ones and let G" be the respective incidence graph. Transform 
(r,B) into (T,B") as follows: 

— Replace each occurrence of an and or OR gate X with the respective carrying 
clause and the variable corresponding to the output of X . 

— Replace each occurrence of a not gate with the variable corresponding to 
the input of the gate (it may either be an input variable of Z or the output 
variable of some and or OR gate). 

Let us show that (T, B") is indeed a tree decomposition of G" of width 2p + 1. 

Each element of a bag of B is replaced by at most 2 elements, hence the size 
of a bag is at most twice the maximal size of bag of B, i.e. at most 2{p + 1). 
Consequently the width of (T, B") is at most 2p+ 1. Let us verify the connect- 
edness property. An original variable a; of is contained in a node t of (T, B") if 
and only if in (T, B) t contains either x or the NOT gate Y with input x. By the 



connectedness property both nodes of (T, B) containing x and those containing 
Y form subtrees and by the adjacency property, these subtrees have at least one 
joint vertex. It follows that their union forms a subtree of T. Each new variable 
y corresponding to a gate C of Z is contained in exactly those nodes of (T, B") 
that contain C or the negation of C in (T, B). Exactly the same argument as in 
the previous case ensures connectedness regarding y. Finally each carrying clause 
C is contained in exactly those nodes of (T, B") that contain the corresponding 
gate in (T, B). So, the connectedness regarding C follows from the connected- 
ness property of (T, B). Thus we have established the connectedness of (T, B"). 
To establish the adjacency property, let C be a carrying clause corresponding 
to a gate X in C and let u be a variable occurring in C. If v corresponds to 
the output of C then the adjacency follows by construction because v is explic- 
itly put in those clauses where X appears. So, assume that v corresponds to an 
input of X. If V is an original variable then (T, B) has a node t containing a 
literal of v and X. By construction, in (T, B"), t contains v and C. So, assume 
that V is the output variable of some gate X' and let C be the corresponding 
carrying clause of X' . It follows that in (T, B) there is anode t containing both 
X and X' . Consequently, in (T, B") the same node t contains C,C',v. So, the 
adjacency property has been established and we conclude that (T, B") is indeed 
a tree decomposition of G" of width 2p + 1. 

Next, we observe that for each AND or OR gate X oi Z and for each variable 
u of F' corresponding to an input of X and for variable y of F' corresponding to 
the output of X, there is a node t of (T, B") containing both y and u. Indeed, 
let C be the carrying clause corresponding to X. By construction, whenever t 
contains C, t also contains y. By the adjacency property, there is at least one 
t containing C and u. Since this last t contains also y, this is a desired clause. 
Pick one node with the specified property and denote it by t{y, u). Add to T a 
new node t' with t{y, u) being its only neighbor. The bag of t' will contain y, u, 
and C{y, u) the auxiliary clause of X corresponding to the input u. Do so for 
all the auxiliary clauses. Finally, let y be the variable occurring in the output 
clause (the clause containing the output literal). Specify a node t containing 
this variable. Add a new node t' for whom t is the only neighbor and add y 
and the output clause to the bag of t' . Let (r*,B*) be the resulting structure. 
Clearly the connectedness is preserved and the adjacency property is established 
for the clauses of F' that are not included in F" . It follows that (T*,B*) is a 
tree decomposition of G' by construction, its width does not exceed the width 
of (T, B'), i.e. at most 2p + 1 and the additional 0{p'^\Z\) nodes (their number 
is bounded by the number of wires of Z plus 1 for the output clause) are leaves. 
The dc^sired runtime of the transformation from (T, B) to (T*, B*) clearly follows 
from the above description. ■ 

It remains to show that a space-efficient dnnf can be created parameterized 
by the treewidth of the incidence graph. 

Theorem 6. Let F be a cnf and let (T', B') be a tree decomposition of the 
incidence graph of F. Then F has a dnnf of size 0(3*|T'|) where t is the width 



of (T',B'). Moreover, given F and (T',B') such a DNNF can be constructed by 
an algorithm having the same runtime. 

The proof of Theorem [6] is provided in Section 15.11 

Proof of Theorem [5l The construction of a dnnf for Z consists of 4 
stages: transform Z into F' by the Tseitin transformation; transform the tree 
decomposition of Z into a tree decomposition of the incidence graph of F'; 
obtain a dnnf of F' as specified by Theorem [S] and obtain a dnnf of Z as 
specified in Lemma [T4l The correctness of this procedure fohows from the above 
discussion. The time and space complexities easily follow from the combination 
of the complexities of intermediate stages. I 

5.1 Proof of Theorem [6] 

The proof of Theorem IH] is based on the same idea as the proof that a CNF 
with the width of the primary graph at most p has a dnnf of size 0{2Pn) [3]. 
The difference is that we have to take into account that the bags of the tree 
decomposition contain clauses as well as variables. Let us introduce notation. 
Let F be the CNF whose dnnf we are going to construct, G be the incidence 
graph of F and (T, B) be a tree decomposition of G. In what follows we identify 
the vertices of G with the respective variables and clauses. For each node t of T, 
we denote the bag of T by B{t). Recall that for an element a G B{t) (either a 
variable or clause), we say that t contains a. We assume that (T, B) is a minimal 
tree decomposition in the sense that removal of any element from a bag violates 
a tree decomposition property. This assumption is not constraining because such 
tree decomposition is easy to obtain by iterative removal of nodes from the bags 
until no further removal is possible. 

We pick an arbitrary node tr of T and let to be the root and in what follows 
we consider T to be a rooted tree. We assume w.l.o.g. that T has at most 2 
children. Indeed, otherwise, if some node t has children ti, . . . ,tr for r > 2, we 
introduce additional nodes t'2, . . . ,t'j. make the sequence t,t'2, ■ ■ ■ ,t'^ going from 
the parent to a child, ti remains a child of t and for each t[, node ti becomes the 
additional child. The bags of ■ • ■ : are made identical to the bag of t. Such 
transformation increases the number of nodes at most twice and hence proving 
the theorem for such transformed tree preserves the desired asymptotic. 

We consider only sets of literals with at most one literal per variable. For a 
set S of literals, let Var{S) be the set of variables whose literals occur in S. We 
denote by Cl{F) and Var{F) the set of clauses and variables of F. For a node 
t of T we denote by Gl{t) and Var{t) the set of clauses and variables contained 
in t. For a subtree T' of T, Cl(T') and Var{T') denote the set of clauses and 
variables contained in the nodes of T'. For a clause C and a set V of variables, 
we denote by Pr{C, V) the projection of C to F i.e. the clause obtained by the 
removal from G the occurrences of all the variables that are not in V. Recall 
that for a set S of literals, we use Pr{S, V) with the analogous meaning. For a 
CNF F', we denote by Pr{F' , V) the CNF obtained from F' by projecting ah 
of its clauses to V. For a subtree T' of T, we denote Pr{F, Var(T')) by F{T'). 



Let us call two circuits (formulas including CNF are regarded a special cases 
of circuits) equivalent if tlicy have the; same set of variables and the same set 
of satisfying assignments. One way to create a formula equivalent to the given 
CNF is Shannon expansion. Let -F be a CNF and let a; be a variable of F. Then 
F\x denotes the CNF obtained from F by removal of all the clauses containing 
X and removal from of all the occurrences of -^x from the remaining clauses. It is 
known that {F\x)x\/ is equivalent to F. Applying this transformation 

over a set V of variables works as follows. Let 5* be a set of literals such that 
Var{S) = V. Analogously to F\x, F\S is the CNF obtained from F by removal 
of all the clauses containing the occurrences of S and removal of the occurrences 
of the opposite literals from the remaining clauses. Let us call the disjunction 
Vvar(S)=y(-^l'^)'^ the generalized Shannon expansion of F w.r.t. V. Applying 
the Shannon expansion inductively, it is not hard to show that the generalized 
Shannon expansion of F w.r.t. V is equivalent to F. 

Let us extend our notation. We denote by F \ C" the set of clauses obtained 
from F by removal of all the clauses of C. Let t' be the root of T' and let 
C" C Cl{t') and let S* bo a set of literals assigning a set of variables V' C Varit'). 
We denote {Pr{F\C' , Var{T')))\S by F{T', C , S) and call it a residual of F(r') 
(induced by C and S if the context requires mentioning it). When S or C is 
empty, we can use F{T' , C) and F{T' , S) with the obvious meaning. If 5* assigns 
all the variables contained in t', we say that F{T', C", S) is a basic residual (BR). 
Finally, we call the function (/\ S') A F{T', C , S) extended residual (ER) (/\ S' 
means conjunction of all the literals of S' C S) or, in case Var{S) = Var{t'), 
extended basic residual (EBR). 

Lemma 16. Any residual or extended residual of F{T') is equivalent to a dis- 
junction ofEBRs ofF(T'). 

Proof. Let F(T', C", S) be a residual of F{T'). Let Fi be a disjunction over 
allF{T',C',SUSi)A/\Si where Si is a set of literals ofVar{t')\Var{S). Then 
F{T', C, S) is equivalent to Fi because Fi is a generalized Shannon expansion 
of F(T',C", 5) w.r.t Var{Si) U 

Lemma 17. Let ti be a child oft' and let Ti be the subtree rooted by ti. Let C 
be a clause of F containing an occurrence of a variable x £ Var{Ti) \Var{t'). 
Then C is contained in a node of Ti . 

Proof. By the adjacency property, there must be a node t" of T contain- 
ing both C and x. This node cannot be t' by definition. This node cannot be 
anyone outside Ti because otherwise, by the connectedness property, it will be 
required that x is contained in t' in contradiction to our assumption. It remains 
to conclude that t" is a node of Ti . ■ 

Lemma 18. Let ti and t2 be 2 children ofT' and let Ti and T2 be the subtrees 
of T rooted by them. Let C G Cl{ti) n Cl{t2). Then C contains occurrences of 
Var{Ti) \ Var{t') and ofVar{T2) \ Var{t'). 



Proof. Assume that C does not contain occurrences of, say, Var{Ti) \ 
Varit'). We claim that the occurrences of C can be removed from aU the nodes 
of Ti in contradiction to the minimahty of (T, B). This removal clearly does 
not violate the connectedness property because the path between any two nodes 
outside of Ti does not go through Ti. As for adjacency property, let x be any 
variable contained together with C in a node of Ti. If C and x are adjacent then 
X e Var{Ti) n Var{t') and therefore, their adjacency is witnessed by the bag of 
t'. ■ 

Let F{T',C',S) be a basic residual of F(T'). Let C" C Cl(t') be the union 
of C" and the set of clauses of t' satisfied by S. We call the set Pr{{Cl{ti) n 
Cl{t2))\C" ,Var{T'))\S the branching set of F(T', C", 5). 

Lemma 19. Letti andt2 be the children oft' rooting repective subtrees Ti and 
T2. Then Var{F{T',C',S)) C Var(Ti)AVar{T2). Moreover,the set of clauses 
containing occurrences of both Var(Ti) \ Var{T2) and Var(T2) \ Var{Ti) is 
precisely the branching set of F(T' ,C' , S). 

Proof. By construction, Var{F{T', C , S)) C Var{T'). Furthermore, Var{T') = 
{Var{Ti)AVar{T2)) U Var{t'). Since S assigns Var{t'), it follows that 
Var{F{T',C',S)) C Var{Ti)AVar{T2). 

For the second statement, let C be a clause of F{T', C , S) containing entries 
of both Var{Ti) \ Var{T2) and Var{T2) \ Var{Ti). This means that there is a 
clause C""^ of F such that C = Pr{C""',Var{T'))\S. According to Lemma [HI 
C°'~ e Cl{ti) n Clit2). By construction, C°'^ ^ C". It follows that Pr{{Cl{ti) n 
Cl{t2)) \ C", Var{T'))\S in particular contains Pr(C°^ Var{T'))\S = C. 

Conversely, assume that C belongs to the branching set of F{T' ,C' , S). It 
follows that there is a clause S Cl{ti) n Cl{t2) \ C" related to C as defined 
above. By the connectedness property, Cl[ti) n Cl{t2) C Cl{t') that is, C°'^ 
is contained in t' . Consequently, since C°'^ ^ C", we conclude that is not 
satisfied by S. It follows that C = Pr{C°\ Var{T'))\S is a clause of F(r', C", S). 
According to Lemma ITSl contains occurrences of a; £ Var{Ti) \ Var{t') and 
y e Var{T2) \ Var{t'). By definition, both x and y belong to Var{T') \ Var{S) 
and hence they are preserved in C. By the connectednes property, x G Var{Ti) \ 
Var{T2) and y G Var(T2) \ Var{Ti), hence the opposite direction holds. ■ 

Another method of equivalence preserving transformation is clausal expan- 
sion. Let C be a clause of a CNF F and Ii , /2 be a partition of C. Then, it follows 
from De Morgan laws that (F \ {C} A Ji) V \ {C} A h) is equivalent to F. We 
extend this to the generalized clausal expansion. Let C* be a set of clauses of F. 
For each C 6 C*, define a partition /i(C), /2(C). Let I be the set of ah CNFs 
/ whose set of clauses are exactly one of /i(C),/2(C) for each C 6 C*. Then 
\/ j^i{F \ C*) A / is equivalent to F and called a generalized clausal expansion 
of F w.r.t. I. 

Lemma 20. Let T' be a subtree of T with root t' and assume that t' has two 
children ti and t2 and let Ti and T2 be the subtrees of T rooted by ti and t2, 
respectively. Then each basic residual F{T' ,C' , S) of F{T' ,C' , S) is either un- 
satisfiable or can be represented (for some r) as a {Fi V ... V Fr) where each 



Fi is a conjunction of a residual of F{Ti) and a residual of F(T2). Moreover 
the number of such conjunctions of residuals that are needed to represent all the 
basic residuals F{T' , C", S) does not exceed 3'', where p is the width of (T, B). 

Proof. The unsatisfiability clearly follows if F{T', C , S) contains an empty 
clause. Otherwise, according to Lemma [THl all the occurrences of each clause 
of F{T',C',S) belong to Var{Ti)AVar{T2). Let C* be the set of clauses of 
F(T', C' , S) containing occurrences of variables of both Var{Ti) \ Var{T2) and 
of Var{T2) \ Var{Ti). For each C G C*, let h{C),l2{C) be the partition of C 
into literals of variables of Var{Ti) \ Var{T2) and of Var{T2) \ Var{Ti). Let I 
be the set of CNFs obtained from C* by taking exactly one of /i (C) or I2 (C) 
for each C G C*. Let F* be the clausal expansion of F{T', C , S) w.r.t. C* and 
I. We know that F* is equivalent to F{T\ C , S). 

By construction, F* is a disjunction of CNFs. Let F" be one of the disjuncts. 
Again by construction, F" can be reprsented as the conjunction of CNFs F^ 
whose clauses contain occurrences of Var{Ti) \ Var{T2) only and F2, whose 
clauses contain occurrences of Var(T2) \ Var(Ti). Let Ii be the subset of C* 
consisting of the clauses C € C* such that I2 (C) is taken to F" by the clausal 
expansion and let I2 — C* \ Ii, i.e. those clauses C of C* that /i(C) is taken 
to F" . According to Lemma [T9l C* is nothing else than the branching set of 
F{T',C',S). It follows that for each C € C* there is a set CC°'' C {Cl{ti) n 
Cl{t2)) \ C" such that {C} = PriCC"- ,Var{T'))\S. Let /* = [j^^j^CC"- 
and let /I = Uce/2 Further on, let S'l = Pr{S,Var{Ti)) and let S2 = 

Pr{S, Var{T2)). Let Fi ^ F{Ti,{C" nCl{ti))Ul^,Si) and let F2 = i^(r2, (C" n 
Cl{t2))U 12,82) ■ We claim that F{ = Fi and F^ = F2. We wiU prove only the 
former for the latter is symmetric. 

Let C be a clause of F^. This means that there is a clause of F and 
the clause C"* of F{T',C',S) such that C™* = Pr{C'"' ,Var{T'))\S and C = 
fjint if (J^nt ^ c* OTi C = Pr{C"'\V ar{Ti)) otherwise. In any case C = 
Pr(Pr{C°'' ,Var{T'))\S,Var{Ti)). Since C""^ is not satisfied by S, the opera- 
tion 15* applied to any its subset is in fact the projection to Var{T) \ Var{S). 
With this in mind, we can write 

C = Pr{Pr{C°'' ,Var{T')),Var{Ti)\S = Pr(C°^ Var(T')nFar(Ti))|5 ^ Pr{C°'' ,Var{Ti))\S 

According to our assumption, C is not empty. Let cc be a variable occurring in 
C. By construction, x G Var{Ti) \ Var{t'). It follows from Lemma [T71 that C°'' 
is contained in a node of Var{Ti). By definition, is not satisfied by S, and 
does not belong to C", from whence it follows that C ^ C". Also, by definition, 
if C°'' belongs to a branching set then C'"' G I^. It follows that C'"' i C" U 
and that is not satisfied by Si . Together with the fact that is contained 
in a node of Ti, this implies that Pr{C°^, Var{Ti))\S = C is a clause of Fi. 

Conversely, let C be a clause of F^. Then there is a clause of F that is 
contained in some node of Ti such that C = Pr(C""', Var{Ti))\Si. Then C'"' is 
not satisfied by S. Indeed, is not satisfied by Si by definition of Fi. If C""^ 
is satisfied by any element of S' \ S'l this means that C is contained in a node 
outside Ti and hence, by the connectedness property, C is contained in t' . But 



then C G C" in contradiction to the definition of Fi . Taking into account that 
C" C C", C™* = Pr{C'"-,Var{T'))\S is a clause of F{T',C',S). If C*"* e C* 
then, by definition of Fi, G /| and hence Fr{C"''*, Vari) = C is a clause of 
F{ . If C*"* ^ C* then, according to Lemma [T21 all the occurrences of C™* are 
of Far(ri) \ Var{T2) or all the occurrences of C™* are of Var{T2) \ Var{Ti). 
Notice that the latter case causes contradiction. Indeed, since F(T' ,C' , S) does 
not contain empty clauses, C*"* in particular is not empty. Let a; be a variable 
occurring in C™*. By our assumption x G Var{T2). In fact, by construction 
of F{T',C',S), X G Var{T2) \ Var{t'). It follows from Lemma [JT] that C'"' 
is contained in a node of T2. Since is also contained in a node of Ti, it 
follows from the connectedness property that C""" G Cl{ti)r\Cl{t2) ■ Furthermore, 
^ C" by definition of Fi. Consequently, C*"* belongs to the branching set 
of F{T' ,C' , S). By Lemma [TOl C*"* G C* in contradiction to our assumption.lt 
remains to assume that all the entries of C*"* belong to Var{Ti) \ Var{T2). In 
this case C™* = Pr{C^"'* ,Var{Ti)) = C is a clause of Fl as required. 

Now, let us calculate the number of conjunctions Fi AF2 needed to represent 
all the EBRs of T' . Each F1AF2 is unambiguously determined by the set C", the 
partition /*, /| oi Cl{ti)r\Cl{t2)\C" and the assignment of variables contained in 
t' Let pi-,P2 be the number of clauses and variables contained in t' , respectively. 
It follows that there are at most 3^1 2^^ < ^p^+p^ < 3^ choices for Fi A F2, as 
required. ■ 

Lemma 21. Let t' ,T' ,ti,Ti,t2,T2 be as in the statement of Lemma \20\ and let 
D be a DNNF containing gates with outputs computing all the basic residuals 0/ 
F[Ti) and of F{T2). Then a DNNF computing all the basic residuals of F(T') 
can be computed by adding 0(3^) new gates, where p is the width of (T, B). 

Proof. Each basic residual F{T', C, S) is unambiguously defined by the set 
of clauses C and the assignment S to the variables contained in t' . Let pi be the 
number of clauses and P2 be the number of variables. Then the number of choices 
is at most 2^1 * 2^^ = 2^. The DNNF being constructed will have at most 2? 
OR gates whose outputs are the BRs of F(T') and the inputs are conjunctions 
of residuals of F{Ti) and F{T2) as specified in Lemma fIU\ Let Fi A F2 be a 
conjunction of a residual of F(Ti) and a residual of F{T2). The number of such 
conjunctions needed to form inputs of the above OR gates is at most 3^' according 
to Lemma [20l Each Fi is formed as a disjunction of EBRs of F{Ti), according 
to Lemma [TBI contributing another 2 * 3^ to the overall number of gates. 

Let us calculate the number of EBRs of F{Ti) we need in order to compute 
all the required residuals of F{Ti). For F{T2) the calculation will be analogous. 
For each residual F(Ti, Ci, 5'i) participating in a conjunction Fi A F2 as above, 
Var{Si) = Var{ti)r\Var{t') (see the construction in the proof of Lemma[20|). Let 
pi = \Var[Si) \ and p2 = \Cl{ti)\. Then the number of ways to form the residual 
is at most 2^'^"''^^. Applying the generalized Shannon expansion, we observe that 
each EBR participating in the disjunction forming F{Ti^Ci, Si) is in the form 
/\ 53 A F(Ti , Ci , U 5*3) where S'3 is the set of literals of the remaining variables 
contained in ti. It is not hard to see that the number of possible 5*3 is at most 



follows that the number of required EBRs is at most 2^, each of 
them formed as the conjunction of the respective BR, available as one of outputs 
of D and the set of at most p literals requiring 0{p) gates for their computation. 
Thus we conclude that 0{2Pp) gates will be enough for computing of all the 
required EBRs of F{Ti) and F{T2). Summing up numbers of gates considered 
throughout the proof, we conclude that 0(3^) additional gates will be sufficient 
for our purpose. ■ 

Proof of Theorem [6l We order nodes of T so that every child appears before 
its parent. By induction on this order relation, we prove that it is possible to 
construct a DNNF of size 0{'i^'\T\) whose outputs compute aU BRs of FiT') for 
all the subtress T' of T. To make Lemma [H] working for the case where a non- 
leaf node has only one child, we extend T so that such nodes have an additional 
child being a leaf node with the empty bag. 

Let T' be a subtree of T consisting of a single node being a leaf. The only BRs 
of such node are constant true and false functions. Thus the number of BRs 
over all leaf nodes is 0(1), so regarding these nodes the inductive claim holds. 
Applying Lemma [H] inductively, for each non-singleton subtree T', we observe 
that in order to compute basic residuals of F{T') requires at most 3^ additional 
gates, so the claim stands for each non-singleton subtree T' as well and for T 
in particular. It remains to compute F{T). Applying the generalized Shannon 
expansion, we observe that F{T) is a disjunction of at most 2^ EBRs of F(T) 
, however the additional 0(2^) gates preserve the asymptotic. The runtime of 
this construction is discussed in detail in the appendix. ■ 

6 Discussion 

In this paper we presented a theorem that shows that a circuit of treewidth k 
can be transformed into, roughly speaking, an equivalent circuit of treewidth 
9k + 2 with at most 4 times more gates. A consequence of this statement is 
that any space-efficient knowledge compilation parameterized by the treewidth 
of the input circuit can be transformed into a space efficient knowledge compi- 
lation parameterized by the cliquewidth of the input circuit. We elaborated this 
consequence on the example of DNNF. As a result we obtained a theoretically 
efficient but formidably looking space complexity of Therefore, the 

first natural question is how likely it is that this huge exponent base can be 
reduced. 

The next question for further investigation is to check if the proposed upper 
bound can be applied to SDD [S] which is more practical than dnnf in the sense 
that it allows a larger set of queries to be efficiently handled. To answer this 
question positively, it will be sufficient to extend Theorem |6] to the case of SDD, 
the 'upper' levels of the reasoning will be applied analogously to the case of 

DNNF. 

It is important to note that rankwidth is a better parameter for capturing 
dense graphs than cliquewidth in the sense that rankwidth of a graph does not 
exceed its treewidth plus one [12] as well as cliquewidth ^3j, while cliquewidth 



can be exponentially larger than treewidth (and hence rankwidth) yy. Also, 
computing of rankwidth, unlike cliquewidth, is known to be FPT Therefore, 
it is interesting to investigate the relationship between rankwidth and treewidth 
of a Boolean function. For this purpose rankwidth has to be extended to directed 
graphs [15]. It is worth saying that if the question is answered negatively, i.e. 
that treewidth of a circuit can be exponentially larger than its rankwidth, it 
would be an interesting circuit complexity result. 

Finally, recall that all the upper bounds on the dnnf size obtained in this 
paper are polynomial in the size of the circuit that can be much larger than 
the number of variables. On the other hand, the upper bound on the dnnf 
size parameterized by the treewidth of the primal graph of the given CNF is 
polynomial in the number of variables [3J. Can we do the same in the circuit 
case? 
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A Cliquewidth vs. simplified cliquewidth 



To define cliquewidth, we introduce a graph G = {V^ E, L) where L is a function 
from V to the set of natural numbers. To distinguish from the labeled graph in 
the SCD, we can call such graph numerically labeled. 
We introduce the following operations on graphs. 

— Let z be a number that L{u) ^ i for all u G V{G) and let v ^ V{G). The 
operation i(v) adds a new vertex v to the graph and extends L so that the 
corresponding number of w is i. 

— Let i, j be two numbers having non-empty preimages in L. Then rjij adds 
all possible edges between vertices labeled with i and vertices labeled with 
j- 

— The operation pij changes to j all vertices having label i. 

— Let Gi — {Vi,Ei,Li) and G2 = (^2,-^2,-^^2) be two graphs with disjoint 
sets of vertices. Then the result of disjoint union Gi ® G2 is the graph 
G = (Vi U V2, El U E2, Li U La). 

A clique decomposition is a binary rooted tree T every node of which is 
associated with a numerically labeled G{t) graph and the following rules are 
observed. 

— Each leaf node is associated with a single vertex graph. 

— Let t be a node having the only child ti. Then G{t) is obtained from G{ti) 
by one of the first 3 operations in the above list. 

— If t is a binary node with children ti and t2 then G{t) = G{ti) © G{t2)- 

The width of the given clique decomposition is the smallest k such that the 
images of all vertices of all graphs G{t) are members of [1, . . . , A;]. The cliquewidth 
of the given graph G is the smallest k such that there is a clique decomposition 
T with the root r such that G = (yiG{r)), E{G{r))), i.e. the function L(G(r)) 
may be arbitrary. 

For the rest of the discussion we need to choose sutiable terminology. First, 
abusing the notation, we associate the decompositions with their trees, cspccally 
as the function G{f.) allows to obtain the graph associated with a particular node. 
Let G be a numerically labeled graph. Then G' = Lb{G) is a labeled graph such 
that the elements of S(G') are sets of vertices assigned with the same number by 
L. Let us call the number of images of the elements of the numerically labeled 
graph G the width of G. Finally for a rooted tree T we denote by r{T) the root 
of T. 

Lemma 22. For any clique decomposition T there is an SCD such that 
Lb{G{r{T)) = G(r(T*)) and the width of is at most twice larger than the 
width ofT. 

Proof. The proof is by induction on the height of T. If T is a leaf with the 
only node associated with a graph G then we create a single-node SCD associated 
with Lb{G). Otherwise, assume that r(T) has the only child r\ ad let Ti be the 



subtree (and the respective clique decomposition) rooted by ri . By the induction 
assumption, there is satisfying the statement of the lemma. If the operation 
associated with r{T) is i{v) then T" is a tree such that r(r*) has the only one 
child r[ rooting subtree and G(r(T'*)) is obtained from G{r[) by adding 
a new vertex v. If the operation associated with r{T) is rjij then G'(r(T^)) is 
obtained from G{r[) by the adding new edges operation between labels Si and Sj 
consisting of vertices labeled by i and j, respectively, in G{ri). If the operation 
associated with r is pij then two sitations are possible. In the first situation, 
graph G'(ri) has vertices labeled by i and j. Let Si and Sj be the sets of vertices 
of G(ri) labeled by i and j, respectively. Then G(r(r'*)) is obtained from G{r[) 
by the union of labels Si and Sj. Otherwise, T'' — Tf , that is, we even do not 
add a new vertex. A direct inspection shows that the lemma holds in all the 
considered cases. 

Assume now that r{T) is a binary node and let ri and r2 be the children of 
r(r) and let Ti and T2 be the subtrees of T rooted by ri and r2, respectively. By 
the induction assumption, there are trees (and the respective SCDs) Tf and T| 
satisfying the conditions of the lemma. Let G' = Lb{G{r{T{))) U Lh{G{r{Tl))) 
(the union operation applies to he sets of vertices, of edges, and of labels) and let 
G" = Lb{G{r)). By definition of the involved operations, there are Si C S{Gi), 
S2 ^ 8(62) and one-to-one correspondence / from Si to S2 such that 

S(G") = (S(G') \ (Si U S2)) U {5 U f{S)\S e Si} 

Let Si — {Si, . . . , Sx}- Then T**, in addition to Tf and T| contains nodes 
zo ■ ■ ■ , Zx such that the children of zq are r{Tf) and ?'(T|f) and, for for each 
1 < j < a;, the child of Zi is Zi-i. Furthermore, G{zo) — G' and for each 
1 < i < X, G{zi) is obtained from G(zi_i) by the union of labels Si and f{Si). 
Clearly, the widh of G' is at most twice larger than the width of G{r{T)) and 
he width of the rest of the additional nodes of Ts is smaller than the treewidth 
of G". Finally,it is not hard to see that G{zx) — G" . Thus the lemma holds for 
the considered case. ■ 

That the SCW of a graph G is at most larger than the cliquewidth of G 
immediately follows from Lemma 

B Runtime for Theorem [1] 

B.l Data structure for clique decomposition 

The above approach to define the SCD is convenient for our reasoning, how- 
ever the explicit representation (T, G) is too time consuming as input for an 
algorithm. Instead, each node of the tree can be associated with the respec- 
tive operation with pointers to labels required to perform the operation, thus 
requiring a constant memory per node of T . 

It is not hard to see by an inductive argumentation that any two elements 
in S are either disjoint or one is a subset of the other. The labels are naturally 



organized into a binary tree according to the child-parent relation with the sin- 
gleton nodes being leaves. It is thus not hard to see that the number of labels ts 
at most 2n — 1. 

We are going to show that the number of nodesof T is 0{kn), where k i the 
width of (T,G). Let S e S{G{t)) for some node t of T. Then we say that t 
contains S. 

For each binary node t, let us identify one of the subtrees rooted by a child of 
this node as the left subtree and the other one as the right subtree. Then define 
a DAG D on the labels of (T, G) as follows. The pair {Si, S2) is an arc of D if 
one of the following conditions hold. 

— Si is contained in the node where S2 is created as a result of union of labels 
or adding a new vertex operation. 

— Both Si ad S2 are contained in a binary node, so that 5*1 is contained in the 
left subtree, while iS'2 is contained in the right subtree. 

Lemma 23. Labels Si and S2 are contained in the same node of t if and only 
if either {Si,S2) or (5*2, S'l) is an arc of D. 

Proof. By induction on the height of the node of T. For a leaf this is obvious. 
Consider a non-leaf node t. If this node satisfies one of the two conditions above, 
we are done. Otherwise, if t is a unary node then both Si and S2 are contained in 
the only child of t, so the statement holds by the induction assumption. If none 
of the above happens then i is a binary node and both Si and ^2 are contained 
in a node of either in the left subtree or in the right subtree. In any case both 
5*1 and S2 are contained in a node of a smaller height and again the induction 
assumption applies. ■ 

By definition of graph D, the in-degree of each vertex is at most fc — 1. Since 
there are 0{n) labels, it follows that the number of arcs of D is 0{kn). It follows 
from Lemma [23] that the number of pairs of labels contained in the same node 
is 0{kn). Consequently,the number of new adjacency nodes is 0{kn). Since the 
number of the rest ofthe nodes is 0{n), we conclude that the number of nodes 
of T is 0{kn). 

B.2 The procedure 

We are going to demonstrate an 0{kn) time procedure that constructs a mixed 
(having both directed and undirected edges) graph H* whose nodes correspond 
to the labels and two labels will be connected by either directed child-parent 
arcs (going from the child to the parent) or undirected adjacency arcs. The size 
of this graph (the number of vertices plus the number of arcs) will be 0{kn). 
Also, each label will be associated with a type (and, or, or unary). F* can be 
straightforwardly obtained from H* by simply substituting labels with suitable 
gates and the arcs with suitable wires as specified by the description of F*, 
implying the 0{kn) construction time for F* . 

Recall that for algorithmic purposes the SCD is represented as a tree whose 
nodes are associated with operations with pointers to the labels. In the resulting 



graph the vertices will be associated with labels. Each label will be supplied with 
the adjacency list specifying the parent and children and label connected by the 
adjacency arcs (if any). Each label and each arc are the result of some operation. 
Therefore, exploring the tree in a topological order from the leaves to the root, 
we will be able to reconstruct H* . 

We start from the empty graph. If the currently considered operation is 
adding a new vertex (gate) of F then give the corresponding singleton label the 
type of this gate (and, or, or unary). If the operation is union of two labels 5*1 
and ^2 then introduce the child-parent arcs from 5*1 and 5*2 to 5*1 U52. Technically 
this means following the pointer to the label 5*1 U52 and adding pointers to Si and 
5*2 to the adjacency list marking them as children and, similarly, adding Si U 5*2 
as the parent to the adjacency lists of 5*1 and 52. Also, the type of Si U 5*2 is 
as the type of 5*1 (or of 5*2, they are the same by definition of type respecting 
clique decomposition). Accordingly, the adjacency operation results in adding 
the adjacency arc between the respective labels. Notice that the binary node of 
the clique decomposition tree does not introduce any changes: it requires union 
of two disjoint graphs but at the time of exploration of the node, the union has 
been already performed because all the modifications specified above are done 
on the same graph, whose nodes are the set of labels of the SCD. 

It is not hard to see from the description that the above procedure takes 
0(1) time per node of T. Since the number of nodes of the tree is 0{kn), the 
desired bounds follow. 

It is not hard to observe that the graph H defined is Section HT^ is isomorphic 
to H* except that H* assigns types to nodes and directions to edges. We will 
establish the tree decomposition of H as specified in the proof of lemma [12] 
following post-order exploration of T (children before the parents). The elements 
of bags will be represented by pointers to the corresponding labels. The only 
element contained in leaf node t is the vertex corresponding to the singleton 
label of t. Assume that t is not a leaf node. If the operation of t is new adjacency 
then remove t and make the parent of t (if any) to be the parent of the only 
child of t Otherwise, copy to the bag of t all the elements from the bags of the 
children of t. Then, if the operation of T is the union of labels 5*1 and ^2, then 
replace the vertices corresponding to and 5*2 by the vertex corresponding to 
5*1 U 52 . It only remains to replace each label by the respective gates of F* . This 
algorithm spends 0{k) time per node of T. It follows that the overall time is 
0{k^n). 

C Runtime calculation for Theorem [H] 

The desired DNNF is constructed inductively from the leaves to the root. First, 
BRs for the subtrees rooted by leaves are constructed. Construction of BRs for 
a non-singleton subtree (having constructed BRs for the immediate subtrees) is 
done in 2 stages. First, the required residuals of the children are produced. Then 
the BRs of the considered subtree are produced as disjunctions of conjunctions 



of residuals of the children as specified in Lemma [501 Having constructed all the 
basic residuals, the desired output F is constructed as a residual F{T, C, C). 

The difficulty of this construction is finding pointers to the in-ncighbors of the 
gate currently constructed. If implemented straightforwardly, the whole array of 
the currently existing gates may have been searched, making the construction 
runtime quadratic is the size of the dnnf being constructed. We propose a more 
sophisticated procedure based on amortized analysis that makes the runtime 
asymptotically the same as the size of the resulting dnnf. The description of 
the procedure provided below is divided into 3 subsections specifying the data 
structures, computation of residuals of the given subtree having computed all 
the basic residuals (including also computation of the root), and computation 
of the basic residuals. The final calculation of the runtime is given in the last 
subsection. 

C.l Data structures 

The circuit is maintained in the form of adjacency list. Put it differently, there 
are records corresponding to each gate. These records contain the gate and the 
pointers to the records of the other gates who are in and out-neighbors of the 
corresponding gate of the considered record. The records are not located in a ho- 
mogenouos array but rather grouped around the nodes of the tree decomposition 
(r, B). Let us see how to do that. 

An important subset of the gates are those whose output are BRs. Sligtly 
abusing the notation, we call these gats BRs as well. The pointer to each BR 
F(T' , C", S') is conatined in the record associated with the root of T' . At the time 
of construction of the circuit, it is important to very efficiently find the record 
associated with each gate of the DNNF being constructed. For this purpose, each 
BR F{T' , C, S') is associated with the elements of C and S'. In particular, such 
BRs are kept in an array, let us call it BR{T'). The sets of variables and clauses 
of the input CNF are linearly ordered. This linear order is naturally projected to 
the set of clauses and variables contained in the root r' of T'. Denote by Cl{r') 
and Var{r') the set of clauses and variables, respectively. Then the BRs of F{T') 
are put in correpondence with binary vectors indexed by Cl{r') U Var{r'). The 
order of the respective coordinates is exactly as the order of the corresponding 
elements in the above mentioned order of variables and clauses. Let C be a clause 
contained in r' and let a; be a vector corresponding to F{T' ,C' , S'). Then the 
coordinate of C is if and only if C ^ C", i.e. C is not removed. If y i a variable 
contained in r' then the coordinate of F in x is 1 if 1" G S" and otherwise, i.e. 
-lY G 5". The vectors x, considered as binary numbers, serve as array indices. 
This means that BR{T')[x] contains the pointer to the gate whose output is 
F{T' ,C' , S'). Consequently, given x, this pointer can be obtained in 0(1). We 
call X the characteristic vector of F{T' ,C' , S'). 

Assume that r' is not the root and let r* be the parent of r' . Then the storage 
of T' also maintains an array RR{T') of pointers to the residuals F{T' ,C' , S') 
such that C C CT(r') n Cl{r*) and Var{S') C Var{r') n Var{r*). The vectors 
of RR{T') are enumerated analogously to BR{T') but indexed by elements of 



/ = {Cl{r') n Cl{r*)) U {Var{r') n Var{r*)) ordered according to the above 
mentioned order of variables and clauses of F . 

C.2 Construction of residuals given basic residuals 

It follows from Lemma [1^] that each residual is a disjunction of EBRs. We are 
going to show how to construct the circuit computing the residuals provided 
the gates computing the BRs have already been constructed. The first step 
is simple. We go along the array RR(T') and specify in the record of each 
corresponding gate that this gate is a disjunction. Now we are going to create the 
rest of the circuit. The first step is to create a binary vector Pattern indexed 
by Cl{r') U Varir') exactly in the same order of coordinates as for BR{T') 
the 1 entries correspond precisely to the elements of /. We also need a binary 
vector ClVar indexed in the ame way the element equals one if and only if the 
corresponding element of Cl{r') U Val{r') is a clause. Both of these vectors can 
be prepared in a time polynomial in k. Since the whole time of the computation 
of RR{T') is exponential in fc, this runtime may be not taken into account, so 
we do not elaborate on it anymore. Next, we process each element of BR{T'). 
The processing of the given element BR{T')[x\ conists of 3 stages. 

— Redundancy testing. On that stage the algorithm tests if the given BR is 
needed at aU for the forming of the array RR{T'). Let F(T', C", S') be the 
BR of F(T') corresponding to x. Then F{T' , C", S') is redundant if and only 
if C \ Cl{r*) ^ 0. Such element exists if and only if there is a coordinate i 
such that x[i] = Pattern[i] — and ClVar[i] = 1. Clearly,the whole testing 
can be done in 0(fc) per vector x just by straightforward exploration of the 
vectors. 

Construction of the corresponding EBR. We specify S" C S' such 
that Var{S") = Var{S') \ Var{r*). The elements of S" correspond to those 
coordinates i of x where Pattern[i\ = ClVar[i] — 0. Create a new con- 
junction with inputs being precisely elements of S and F(T' ,C' , S'). This 
operation can be performed in 0{k) (we may safely assume that each coor- 
dinate is accompanied with pointers to both of the variables). The output 
of the obtained conjunction is the EBR A S" A F{T', C , S'). 

— Connecting the EBR to the input of the corresponding residual. 
We create a new vector y and copy there the elements of x on coordinates 
i where Pattern[i] = 1. Clearly, y can be created in 0{k). Then we connect 
the output of the conjunction created in the previous item to the input of 
the residual pointedto by RR[T']{y). This can be done in 0(1). 

It follows from Lemma [16] and by construction that each gate pointed to by 
an element of RR{T') is indeed a residual of T' as specified. Notice also that we 
have not applied the reuse of conjnctions of literals as was specified in the proof 
of Lemma (211 however, it does not increase the asymptotic space of 0(2''') nor 
the runtime 0{2''k) spent to the contruction of residuals of T'. In the case of 
root r, we need to have the residual F{T, 0, 0) whose output is the function of 



the CNF F. This can be done according to the same scheme. That is, we explore 
the array BR{T) extracting elements F{T, 0, S) and forming the dijunctions of 
all A'S'Ai^(r,0,S'). 

C.3 Construction of basic residuals 

Wc arc now going to describe the construction of the rest of the DNNF including 
the gates whose outputs are BRs, their in-neighbours and the rest of incident 
arcs. Let T' be a subtree of T having only one node, that is its root is a leaf of T. 
In order to construct BR{T'), we explore all the characteristic vectors x of the 
basic residuals of F{T'). For the given x, BR(T')[x\ points to the true constant 
if the corresponding BR is true constant and to the false constant otherwise (i.e. 
when the corresponding BR is a false constant). In order to keep the complexity 
of this step within the desired boundary, it is essential that the CNF would be 
represented in the form of adjacency matrix that allows 0(1) testing if the given 
literal belongs to a particular clause. In this case, it is not hard to see that the 
complexity ofthis step is 0(2'^A;^). 

Assume that T' contains more than one node. We assume w.l.o.g. that the 
root r' has two children ri and r2 (the reasoning for one child is a restricted 
version of the reasoning for the case of two children). Again, we explore all the 
characteristic vectors of r' . Let x be such vector and let F{T' ,C' , S') be the 
corresponding BR. The first step is to sec if there is C G Cl{r) \ [C' U C/(ri) U 
C/(r2)) such that C is not satisfied by any literal of S' . If such C is found 
then the respective BR is unsatisfiable and all the algorithm has to do is to 
record the pointer to the false constant in BR{T')[x\. Otherwise, F{T' ,C' , S') 
is represented as the disjunction of conjunctions of pairs of residuals of Ti and T2. 
The number of such conjnctions over all the characteristic vectors is 3*^, hence it 
would not be difficult to design a procedure whose runtime is proportional to 3*^ 
multiplied by a polynomial of k. However, we want to get rid of the polynomial 
factor and hence the procedure will be more tricky to enable the amortised 
analysis. 

A standard data structure for amortised analysis is the binary counter. Con- 
sider a binary vector of k elements and let us compute the runtime of 2*^ con- 
secutive increments. Although the runtime of one particular increment can be 
as large as 0{k) due to the carry the overall runtime is 0(2*^), i.e. 0(1) per 
increment. In our construction, we use a refined version of binary counter which 
we call selective counter. In this counter, there arc a number of fixed digits and 
the increment is performed only on the digits that are not fixed. There are a 
few ways how to keep information about non-fixed digits so that the next digit 
can be found in 0(1). For example there may be a pointer to the rightmost 
non-fixed digit and each non-fixed digit can contain a pointer to the next one 
and the last digit also records some bit telling the algorithm about that. Let fci 
be the number of non-fixed digits. Then, it is not hard to sec that 2*^1 increments 
can be performed in 0(2'^^). It is important to notice that if we use decrement 
instead increment then we have the same upperbound on the runtime. 



Back to the DNNF construction, given we introduce two vectors xi and 
X2- The coordinates of xi correspond to {Cl{r') n C/(ri)) U {Var{r') n Var{ri)). 
The element of xi whose coordinate corresponds to a variable v equals 1 if and 
only if w S S' . Otherwise (i.e. if -^v 6 S") the element equals 0. The elements 
corresponding to clauses can be partitioned into the following 3 sets. 

— Elements, whose coordinates correspond to clauses of C", equal 0. 

— Elements, whose coordinates coorespond to clauses of Cl{ri) \ {Cl{r2) U C"), 
are 1. 

— Elements, whose coordinates correspond to clauses of {Cl{ri) n Cl{r2)) \ C", 
are 0. 

The structure of vector X2 is symmetric with the roles of r2 and ri exchanged. 
The only difference is that elements of coordinates as in the last item of the above 
list are 1. 

We treat vectors xi and X2 as selective binary counters with [Cl{ri){^Cl{r2))\ 
C being coordinates of non-fixed digits, the increment operation applied to xi 
and the decrement operation applied to X2- Then the algorithm proceeds as 
follows. 

— We set the gate BR{T')[x\ points to as the OR-gate. 

— We create the data structure with two items whose initial value is {xi^X2) 
perceived as binary vectors as defined above. The only operation of this 
data strcture is the modification applying increment to the first item and 
decrement to the second one. Let k' be the number of non fixed digits in the 
above selective vectors. Then this data structure can be in 2^ possible states 
including (xi, X2) and the states obtained by sequences of modifications. The 
amortised analysis argument shows that all these states can be explored in 
0{2^ ) i.e. the time proportional to the number of states. 

— For each state {x'i,X2) as above, create a conjunction whose inputs will be 
the gates pointed to by RR{Ti)[x'^ and RR{T2)[x'^ and whose output is an 
input of the gate OR as in the first item of this list. 

It follows by construction and from Lemma that the output of the OR gate of 
BR{T')[x\ is indeed F{T', C", S'). The runtime spent to construction of BR[T') 
can be calculated as follows. Checking whether the given BR is a false constant 
takes a polynomial time per x, so the total time is 2^ multiplied by a polynomial 
of k. The same can be said regarding creation of the data structure as in the 
above list. Exploration of the states the data structure over all the vectors x takes 
0(3*^). This follows from Lemma [20l and from the fact that by construction, the 
algorithm spends 0(1) per such state. 

The description of the procedure for creation of the dnnf is now complete. 
Summarising, the runtime calculations we see that it takes 0{i^) time per node 
of the tree decomposition. 



